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Abstract. We prove that the homogeneous hierarchical Anderson model 
exhibits a Lifshits tail at the upper edge of its spectrum. The Lifshits 
exponent is given in terms of the spectral dimension of the homogeneous 
hierarchical structure. Our approach is based on Dirichlet-Neumann 
bracketing for the hierarchical Laplacian and a large-deviation argu- 
ment. 
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^ ' 1. Introduction 

Hierarchical models have a long tradition in statistical physics. Dyson |D1( 



ID2j introduced them as an auxiliary tool in his study of phase transitions 
in the one-dimensional Ising ferromagnet with long-range interactions. An 
important feature of hierarchical models is that they preserve their structure 
under renormalisation-group transformations. Bleher and Sinai |B1S1[ [B1S2 
exploited this to determine critical properties of hierarchical spin models. 

Bovier [Boj seems to be the first who studied the hierarchical Ander- 
son model, that is, the Anderson model on a countably infinite configuration 
space with kinetic energy given by the hierarchical Laplacian. He also pur- 
sued a renormalisation-group approach and showed analyticity properties of 
the density of states. Under very mild conditions, Molchanov |Mol[ IMo2j 
established that the hierarchical Anderson model with Cauchy-distributed 
random variables has only pure point spectrum. In particular, his result does 
not require a homogeneous hierarchical structure. More recently, Kritchevski 
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|Krli IKr2[ IKr3| continued the investigations of the hierarchical Anderson 
model. In |Krl[ IKr2j he removed the requirement for a Cauchy distribution 
and proved Anderson localisation at all energies and for general single-site 
distributions. However, his proof only works for homogeneous hierarchical 
structures with spectral dimension d s ^ 4 (the spectral dimension will be 
introduced in Definition 12.41 below). In |Kr3] he proved Poisson statistics of 
rescaled eigenvalue distributions for the homogeneous hierarchical Anderson 
model with d s < 1. If the hierarchical Laplacian is modified as to contain 
also suitable negative hopping rates, then Monthus and Garel [MoGj argue 
in favour of a localisation-delocalisation transition in the hierarchical Ander- 
son model. Fyodorov, Ossipov and Rodriguez found analytical and numerical 
evidence for a localisation-delocalisation transition for random matrices with 
a hierarchical structure of the correlations between the matrix entries |FOR| . 

In this paper we prove the occurrence of a Lifshits tail at the upper 
spectral edge of the hierarchical Anderson model. More precisely, we only 
deal with homogeneous hierarchical structures for which one can define a 
spectral dimension d s (but we do not impose any restriction on the value of 
d s ). In particular, we find that the Lifshits exponent of the integrated density 
of states of the hierarchical Anderson model coincides with d s /2. Technically, 
we follow the approach that was successfully employed for Poisson and alloy- 
type random Schrodinger operators by Kirsch and Martinelli |KiMa| , and for 
the Anderson model on the lattice by Simon [S], see also the recent survey 
by Kirsch [Ki] . The method requires Dirichlet-Neumann bracketing for finite- 
volume operators which we establish for suitable finite- volume restrictions of 
the hierarchical Laplacian. 

The quantity d s /2, which we find for the Lifshits exponent in Theo- 
rem 12.61 is also referred to as the van Hove exponent, since it governs the 
van Hove "singularities" of the integrated density of states in the absence 
of disorder, see also Lemma T2.5I Amazingly, the equality of the Lifshits and 
van Hove exponent is known to hold for very different types of random hop- 
ping models. First and foremost we mention the standard alloy-type random 
Schrodinger operators in R d or Z d , rf e N. In that case d s — d, and the 
Lifshits exponent equals d/2, see e.g. |KiMa( [5] and references therein. But 
it also holds for the integrated density of states of the Dirichlet Laplacian 
on percolation subgraphs. This was first shown for bond percolation on Z d 
in |KiMii[ IMiiSlj and then generalised to a large class of Cayley graphs in 
|AVj . see also the recent review |MuS2j . From this perspective, Theorem 12.61 
is interesting because it establishes the equality of the Lifshits and van Hove 
exponent for a random perturbation of the rather peculiar hierarchical Lapla- 
cian. The deeper reason behind it in all mentioned models is the intuition 
explained in Remark 12.71 

The paper is organised as follows. After introducing our notation and 
presenting the main result, Theorem 12. 6[ in Sect. [21 we turn to Dirichlet- 
Neumann bracketing for hierarchical finite-volume operators in Sect. El Fi- 
nally, Sect. 2] is devoted to the proof of Theorem 12.61 and the appendix 
compiles the ergodic structure of the hierarchical Anderson model. 
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2. Model and result 

We consider a countably infinite configuration space X and the quantum 
Hamiltonian 

H^-.^A + V" (2.1) 

acting on the Hilbert space £ 2 (H) of complex- valued, square-summable se- 
quences over X. The Hamiltonian describes diagonal disorder through its 
potential energy, which acts as the multiplication operator 

(V u i/))(x) := uj x ip(x) for all ip e ^ 2 (X) and all i£i (2.2) 

Here, uj :— (uj x ) x< =x is a family of independent and identically distributed 
(i.i.d.), real- valued random variables. We think of them as being canonically 
realised in the probability space £1 := M x , equipped with the product Borcl 
cr-algebra (S>zex^ R anc ^ the P r °duct probability measure P := (S^ex^o- 
We assume throughout that the single-site distribution Po is compactly sup- 
ported, suppPo C for some 6 M, in order to avoid irrelevant 
technical complications in dealing with unbounded operators. For the proof 
of the Lifshits tail in Theorem 12.61 we suppose in addition that Po is not 
concentrated at one single point, i.e. 

P (W) < 1 for every v e R, (2.3) 

and that its upper tail decays no faster than any power, i.e. there exist real 
constants C, \i > such that 

P ([v+ -e,v+]) >Ce» (2.4) 

for every e > sufficiently small. 

The operator A in (12.11) is the hierarchical Laplacian (12.71) and refers to 
a hierarchical structure on X, which we need to explain first. A hierarchical 
structure on X is a sequence of partitions (V r ) r eN of X together with a 
sequence (n r ) r( zm of natural numbers such that the properties (HI) - (H3) 
below hold. By definition, each partition subdivides X into mutually disjoint 
subsets, which we call clusters. The clusters of V r are referred to as clusters 
of rank r. 

(HI) Vo is the trivial partition, the clusters of which consist precisely of the 

single elements of X. 
(H2) Every cluster of rank r s N is a union of n r distinct clusters of rank 

r- 1. 

(H3) Given i,i/£l there is a cluster of some rank containing both x and y. 

We denote by Q r (x) the unique cluster of rank r containing x G X, and we 
write \A\ for the number of elements of a finite set A. By (H2) the number 
of elements \Q r (x)\ — rir'=i nr ' °f tms duster does not depend on x E X. 
Thus, we will simply write \Q r \ for the cluster size. The elements of X can 
be enumerated 

No^X, k^x k , (2.5) 
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Xo X\ X2 X3 X4 Xf, Xf) X7 Xg Xg X\o Xll 

Figure 1. Sketch of a hierarchical structure for n\ — 
3, ri2 = 2,713 = 2 up to the rank-3-cluster containing xq. 
A function taking on the values ±<z € C \ {0}, when re- 
stricted to the clusters Qi(xo), Qi(x3), and being equal to 
zero everywhere else is an eigenfunction of the hierarchical 
Laplacian A corresponding to the eigenvalue Ai. 



in such a way that x^ and Xk 2 belong to the same cluster of rank r if and 
only if there exists M £ N with k\, k 2 € {M\Q r \, . . . , (M + l)\Q r \ - 1}, see 
Fig.ffl 

A hierarchical structure is called homogeneous of degree n € N, if 

n r — n for all r £ N. (2.6) 

In this case, the size of any cluster of rank r is given by \Q r \ — n r . 

Given a sequence of probability weights (p s ) se N, < p s < 1 for all s £ N 
and p s — 1, the hierarchical Laplacian is defined as the weighted sum 

00 

A:=^p s E s (2.7) 

s=l 

of the cluster averaging operators E s : ^ 2 (X) — > £ 2 (X.), 

(E s ^)(x):=^- J2 (2-8) 

where ip £ ^ 2 (X), x £ X and s 6 Ho. For convenience, we also introduce 
Po := 0. Note that Eo = 1, the identity operator, and that p s ^ for every 
s £ N. The random operator (|2.1j) is referred to as the hierarchical Ander- 
son Hamiltonian or the hierarchical Anderson model. We use the additional 
specification "homogeneous" if (|2.6[) holds. 

The basic spectral theorem for the self-adjoint hierarchical Laplacian A 

is 

Theorem 2.1. The spectral decomposition of the hierarchical Laplacian A 
reads 

00 

A = ^A r (E r -E. r+1 ), (2.9) 

r=0 
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where 

T 

X r := J2 Pa, re N , (2.10) 

s=0 

are its eigenvalues (of infinite multiplicity) and E r — E r+ i is the orthogonal 
projection onto the eigenspace corresponding to X r . In particular, these eigen- 
values and their accumulation point Xoo := 1 belong to the essential spectrum 
of A. 

We refer to |Krl[ Thm. 1.1] for a proof. The spectral value Aoo = 1 
is never an eigenvalue since p s > for every s G N. Any eigenfunction 
■i/v G I 2 (X) corresponding to the eigenvalue X r is constant on every cluster of 
rank r. In addition, the sum of these constants over all rank-r-clusters which 
belong to the same rank-(r + l)-cluster is always equal to zero, see Fig. [T]for 
a sketch. 

Using |Kr21 Lemma 1.2] together with ergodicity of H u , see Lemmas I A. II 
and El we conclude the standard 

Lemma 2.2. There exists a non-random compact subset £ C M such that 
spec-ff" = £ for P-a.e. usSl. The location of the deterministic spectrum £ 
obeys 

specA + suppPo CEC ( spec A + ch(suppP )) n ([0, 1] + suppP ), (2.11) 

where chB denotes the convex hull of a set B C M. In particular, we have 
sup(inf)£ = sup(inf){spec A + suppPo}, and i/suppPo is even connected, 
then also £ = spec A + supp Po . 

Remark 2.3. The preceding lemma strengthens Lemma 1.2 in |Kr2j . in as 
much as non-randomness of speciJ" is established irrespective of the con- 
nectedness of suppPo- 

For homogeneous hierarchical structures, we will focus on the special case 
where the decay rate of (p s ) s6 N is linked to the degree n of the structure. 

Definition 2.4. Consider a homogeneous hierarchical structure of degree n ^ 
2. Suppose that there exist constants C\, Ci > and p > 1 such that 

C xP ~ r ^p r ^ C 2 p- r (2.12) 

for all r G N large enough. Then the spectral dimension of this model is 
defined as 

d s = d s (n,p):=2 ] ^. (2.13) 
In p 

In other words, this amounts to Cin~ 2r l ds ^ p r ^ C^nT 2r l ds for large 
r. One motivation for the definition of d s will be given by Lemma 12.51 below. 
To this end we introduce the integrated density of states Nq : K — > [0, 1] of 
A, which is defined by 



E^N Q (E) := (S X0 ,X]-oo,e](A)S X0 ) 



(2.14) 
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for some xq G X. Here, Xb stands for the indicator function of a set B, (•, •) 
denotes the canonical scalar product of the Hilbert space £ 2 (X) and S x the 
canonical basis vector associated with x G X, i.e. 8 x {y) = for every y £ 
X \ {x} and S x (x) — 1. We remark that N is a right-continuous distribution 
function, which is normalised according to Nq(1) = 1. Moreover, it does not 
depend on the choice of xq . This can be seen from the more explicit expression 

i U = 1 L 

]Qr\ \Qr+l\J \Qr(E) 



*o(E)= E ( 7777 " 777^) = 1- 777^ — r> ( 2 - 15 ) 



r£N : \ r ^E 

which follows from the spectral representation (|2.9j) . The second equality in 
(|2.15l) makes only sense for E £ [0, 1[ with r{E) := max{r' G N : A r > ^ E}. 

Lemma 2.5. Lei A be the hierarchical Laplacian of a homogeneous hierarchical 
structure with spectral dimension d s . Then the integrated density of states Nq 
of A exhibits the upper-edge asymptotics 

]n[l-*o(l-£Q] = j, 
e\,o InE 2 v ' 

We refer to, e.g., |Krl[ Prop. 1.3] or |Mo2| for a proof. 

Next, we turn to the central quantity of this paper. The integrated den- 
sity of states N : K. — >■ [0, 1] of the hierarchical Anderson model is defined 

by 

E h-> N(E) :=E[(4 ,X]-oo,£](i? w )4 )], (2.17) 
where E denotes the probabilistic expectation associated with P. The inte- 
grated density of states is a right-continuous distribution function and inde- 
pendent of the choice of xo G X. The set of growth points of N coincides P-a.s. 
with the compact deterministic spectrum described in (|2.1ip . At the lower 
spectral edge inf supp Po the asymptotics of N is solely determined by the 
single-site distribution Po of the potential, because the second-lowest eigen- 
value Ai = pi > of the hierarchical Laplacian is separated from Ao = by a 
gap. The interesting case is the upper spectral edge 1 + sup supp Po , because 
the eigenvalues of the hierarchical Laplacian accumulate at sup spec A = 1. 
The main result of the present paper, Theorem 12.61 concerns this case. 

Theorem 2.6. Let N be the integrated density of states of a homogeneous 
hierarchical Anderson model with the properties (12. 3|) . (|2.4j) and (|2.12|> . Then. 

N has a Lif shits tail at the upper spectral edge 1 + v + in the sense that 

m|h[i-^(i + i, + -^| = _j, 

e\q InE 2 v ' 

We note that N(l + v+) = 1 and recall that d s denotes the spectral dimension 
(|2.13p of the homogeneous hierarchical model. 

The proof of Theorem 12.61 can be found in Sect. @] 



Remark 2.7. The intuition behind Theorem l2.6l is that eigenvalues 1 + v + — E 
close to the upper edge of the spectrum, i.e. for E < 1, must maximise both 
the kinetic and the potential energy. In order to achieve a large kinetic energy 
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of the order 1 — E =: 1— X^L re (.E) Prt the associated eigenfunction must be ap- 
proximately constant over a cluster of high rank k(E) ~ —d s (\nE)/(2 Inn) 3> 
1, that is, with volume \Q k (e) \ — n K ^ E ' ~ E~ da l 2 . To ensure a large potential 
energy at the same time then requires most coupling constants of this cluster 
to take values close to v + = supsuppPo- This is a large-deviation event with 
approximate probability exp{— |<3 k (,e)|} ~ exp{— E^ d ^ 2 } that sets the scale 
for N(l + v + — E). We refer to the end of Sect. [T]for further comments on 
the fact that the Lifshits exponent equals d s /2. 



3. Dirichlet-Neumann bracketing 

Technically, many proofs of Lifshits tails rely on Dirichlet-Neumann bracket- 
ing. This allows for a two-sided estimate of the integrated density of states 
in terms of finite- volume operators. We will also follow this route. Thus, it is 
a main point of this paper to find a pair of suitable finite- volume restrictions 
of the hierarchical Laplacian for which Dirichlet-Neumann bracketing works. 

Definition 3.1. For xq G X fixed and k G No we consider the finite clus- 
ter Q K = Q k (xq) and introduce the Neumann, resp. Dirichlet finite- volume 
restrictions 

K OO 

An,q k := £ A D ^ K := A n , Qk + £ Ps l|^ (3.1) 

S — 1 S — ft - ) - ! 

of the hierarchical Laplacian to the finite-dimensional subspace £ 2 (Q K ). For 
X G {N, D} we then set 

HI Qk := A x , Q)s + V". (3.2) 

To simplify notation we write Ax. K = Ax,q k and H£ K = H£ , if there is 
no danger of confusion. 

The desired property is stated in 

Lemma 3.2 (Dirichlet-Neumann decoupling). Consider a fixed finite cluster 
Q K of rank n G N. Let r G No, r < k, and assume that the cluster Q K is 
the union of m disjoint clusters Q\, . . . , Q™ of lower rank r. Writing H^ 3 r = 
iT x Qj for X G {N, D}, we have in the sense of quadratic forms 

rn m 

H$, K 0"v; ond #rU<0#r#' (3-3) 

3=1 3=1 

Proof. The subspace i 2 (Q 3 r ) is left invariant under E s |^q . for every s G 
{1, . . . , r} and every j G {1, . . . , m}. Thus, we have 

(rn \ k m 

©3#)+ £ ^L (0 ^® H ^ (3 - 4) 
j = l ' s=r+l (Wn) j=l 
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and 



"ri, 0//n:;) - E p-(i-b.) „,„ , < (3-5) 



3=1 7 s=r+l y ^ ' j=l 



a 



Being an operator on a finite dimensional space, K has discrete (random) 
eigenvalues 

ex,«(l)<e^(2)<...<e^ re (|Q re |) ) (3.6) 

which are counted according to their multiplicities. We define the correspond- 
ing normalised eigenvalue counting function N£ K : K — > [0, 1] by 

E^NZ K (E):=—J2 X] -oo,E]( e xAj))> Xe{N,D}. (3.7) 

In the macroscopic limit this quantity is self- aver aging and justifies the in- 
terpretation of N as an integrated density of states. 

Lemma 3.3. For X 6 {N, D} there exists a set i7o C ft of full probability, 
P[fio] = lj such that for every uj £ Qq we have 

hm N£ <K (E) = N(E) (3.8) 
ai each continuity point E of N . 

Remark 3.4. The above lemma extends Thm. 3.3 in |Kr3| to the Dirichlet 
case. Whereas we rely on ergodicity, the (longer) argument in |Kr3j follows a 
different route which is based on the law of large numbers instead. 



Proof of Lemma \3.3i Case X = N. Given Q C X, we write trQ for the trace 
over £ 2 (Q) and identify a function on Q with the corresponding multiplication 
operator by this function on £ 2 (Q). The covariance and ergodicity of the 
hierarchical Anderson model, Lemmas I A . 31 and I A . 2 1 yield for every continuous 
function ip G C C (R) with compact support 

' [Xq k{xq) <p(H«)} = Y (6 X0 ,<p(H^))5 X0 ) 



\Q*\ * i-wvxo, ^ /J \Q K \ ^ . 



\ / dP(o/) (5 X0 ,<f(H» )S X0 ) (3.9) 



for P-a.e. w € H. Next, we introduce the unrestricted but truncated operator 
K k '■= J2s=iPs E s + V " on ^ 2 (X). Since K% converges to H u in operator 
norm as k — > oo for P-a.e. w € 0, we conclude that also 

^tr x [ XQAxa) <p{K%)] ™ dP(a/) (8 Xo ,<p(H w ')8 Xo ) (3.10) 

for P-a.e. u £ fl and every given i/3 e C C (R). This implies P-a.s. vague con- 
vergence of the corresponding probability measures on R, see e.g. the proof 
of Thm. 5.5 in [Ki] , Hence, there exists a set J7q C ft of full probability, 
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P[Qo] = 1, such that for every u> S f2o the corresponding distribution func- 
tions converge 

N$JE) = -i-tr x [Xq^X^ooM^)] ™ N(E) (3.11) 

for every continuity point E £ R of N. We note that the left equality above 
relies on £ 2 (Q k (xq)) being an invariant subspace of K%. 

Case X = D. We repeat the argument of the Neumann case with 



K u 



S=K-\-l 



acting on ^ 2 (X). 



(3.12) 



□ 



Lemma 3.5 (Dirichlet-Neumann bracketing). For every cluster Q r of rank 
r £ Nj and for every E 6 K the integrated density of states N obeys the 
two-sided estimate 



E[N^ r (E)] ^N(E)^E[N^ r (E)]. 



(3.13) 



Proof. We fix E e M, r E Nq and xq € X. Consider a cluster Q K = Q k (xq) 
of rank k £ N, k > r, such that Q K is the union of m disjoint rank-r-clusters 
Ql, . . . , for some m S N. Using Lemma 13.21 and the min-max principle, 
we conclude that 



E 



1 



\Qk 



■ tT Q K X]-co,E](H: 



N.kJ 



^ E 



1 



m\Ql 



tr Q K 



(©X]-oo,*|(*#)) 



\ " 



m|Qi| ^ 



X]-oo,B] 



(3.14) 



Here, the last equality relies on the r-cluster permutation invariance of An,t- 
and the identical distribution of the random variables. By dominated conver- 
gence and Lemma [3.3[ the left-hand side of (|3.14p converges to ¥,[N(E)] = 
N(E) as k — y oo, provided £ is a continuity point of N . In this case we 
obtain N(E) ^ E[N$ r (E)]. If E happens to be a discontinuity point of TV - 
for which we have no convergence statement in Lemma 13.31 - simply replace 
E by a monotone decreasing sequence of continuity points Ei \, E and use 
right-continuity. 

The lower bound follows by the same line of reasoning. □ 



4. Proof of Theorem 12.61 

The strategy outlined in Remark 12 . 71 suggests to estimate the maximal eigen- 
value of random operators on finite clusters. For a general self-adjoint op- 
erator, an upper bound on the maximal eigenvalue is provided by Temple's 
inequality, which we recall from |RS1 Thm. XIII. 5] (with A replaced by —A). 
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Lemma 4.1. Let A be a self-adjoint operator in a Hilbert space and let 
E m ax(A) '■= sup spec A be an isolated eigenvalue of A. We write E\(A) := 
sup { spec(A) \ {E max (A)}} and assume the existence of a vector ip in the 
domain of A such that (tp,ip) = 1 and 

l^,A^)>E 1 {A). (4.1) 

Then we have the estimate 

E max (A) < <^> + { ^ A ^_ Ei{A) • (4-2) 



Proof of Theorem \2M We assume w.l.o.g. that v+ = 0. This can always be 
achieved by adding the constant term —v + l to H u . In order to prove the 
assertion, we will construct an upper and a lower bound on 1 — N(l — E), 
which, after taking double logarithms, will asymptotically coincide as E \ 0. 

In what follows we choose some fixed cluster Q K = Q K (xo), xo G X, of 
finite rank k £ N and let < E <C 1 be arbitrary but fixed. 

(a) Upper bound. We recall the notation ()3.6|) and estimate the ex- 
pectation of the finite- volume eigenvalue counting function N£ K {E') for every 
E' e R in terms of the maximal eigenvalue E ms , x (H^ K ) = eg K (\Q K \) of iJg K 

by 

AQ*\ 



E [N£ !K (E')] 



\Qk\ 

> E 



E 



X^ocE'] (E m UH^ K ))\ = F[E max (H^ K ) < E'] . (4.3) 

Setting E' = 1 — E and applying Lemma 13.51 we conclude that 

1 - N(l - E) < 1 - E [N£J1 - E)] < P[£ max (ff£, J > 1 - E] . (4.4) 

To proceed further, we need an upper bound on E max (H^ K )- Temple's 
inequality cannot be applied directly: even the normalised trial function 
■00 = \Qk\~ x ^ 2 £ £ 2 (Qk), which is the eigenfunction of Ad, k correspond- 
ing to the maximal eigenvalue -E m ax(AD, K ) = 1, will not satisfy the condition 
(|4.1[) for k large enough. This problem is circumvented by introducing the 
auxiliary operator 

H^ K := A D . K + V? > A D , K + V" = H^ K (4.5) 

on £ 2 (Q K ) with the new, higher potential V£(x) :— max^, — p K /3} ^ for 
all x G X. This operator satisfies 

{^o,H£,M >l-^>l-p K = E 1 (A^ K ) > E X (H^ K ) (4.6) 

and may thus be employed in Temple's inequality. In order to simplify the 
right-hand side of (|4.2j) . we use the estimates {tp , H^, K 4o) — Ei(Hg K ) ^ 
2p K /3 and (V^) 2 (p K /3)\V£\, and arrive at 



1 

J 



E max (H^ K ) < 1 + £ V t :(x). (4.7) 
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Together with ([4. 41) . this implies 
1-N(1-E) s£I 



\Q 



(4.8) 



By hypothesis of Theorem 12.61 the spectral dimension d s exists and the esti- 
mate (|2.12j) is valid. So we choose 

k = k(E) := max{r G N : \Q r \ ^ {aE)- d " /2 }, (4.9) 

where a > is a free parameter to be determined below. Since, by definition, 
n = p d =/ 2 for some p > 1 and since |Q r | = n r , the inequality in (|4.9p is 
equivalent to ^ Therefore, (|2.12l) guarantees that for E > small 
enough, we have the estimate Pk(E) > C\p^ k ^ ^ C\Ea with some constant 
Ci > 0. This estimate and (|4~8]l imply 



l-N{l-E)^ 



1 



> 



=:F[A k{E) ] (4.10) 



for all E > small enough. 

For every r£N, and assuming a > 6/Ci, it follows that the condition 



1 



\Qr 



{x6Q r :7;(i)>-|} 



> 1 - 



6 



ad 



=: z > 



is necessary for the event A r to occur. Hence 

1 



\Ar\ K 



\Qr\ 



{xeQ r :V?(x)>-£-} 



(4.11) 



(4.12) 



To eliminate the dependence of this probability on p r , we pick 7 G 
] inf supp Po , 0[, introduce the i.i.d. random variables 77^ := X] 7l o]( w a;) f° r 
i£X and note that 

{z 6 Q r : > -^}| = \{x E Q r : uj x > -^}| < ]T ( 413 ) 



xeQ r 



where we assumed 7 ^ —p r /3 in the last step. This implies 

PfA-1 < 



sec 



(4.14) 



for all r G N large enough, since (p r ) r is a null sequence. The latter probability 
can be handled with the standard estimate P(X ^ 8) ^ e~ tS K(e tx ) for any 
£ ^ and any random variable X. Thus there exists ro G N such that for 
every r G N with r ^ ro we have 

P[A] < e-'^l E[e tS W£] = e -* 2 l^l (E [e*<o ]V° rl = e -l«-l/W 

(4.15) 

for any £ 0, where /(t) := — lnEo[e t,?x o] and Eo is the expectation 
associated with the single-site distribution Po. In view of (|4.10j) this means 
that there exists E u > such that for every E G]0, E u ] the estimate 

1- N(l- E) ^e^ Qk ^ lfW (4.16) 
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holds for a\\t ^ 0. 

Now we choose 7 close enough to such that q := E (ry^) = P0Q7, 0] ) £ 
]0, 1[. This is always possible in view of conditions (I2.3[) and (12.41) . By adjust- 
ing the free parameter a > 6/C1 large enough, we ensure in addition that 
q < z. Then we have /(0) = and 



f(0) = z- ° [/ *° 

E [ e '-0 



= Z - q > 0. 



(4.17) 



t=o 



Hence, there is a to > such that /(to) > 0. We remark that neither to nor 
f(to) depend on E. Definition (|4.9I) implies 



|Qfc(K)| = ~\Qk(E)+i\ > ~(aE)- d °/ 2 (4.18) 
n n 

for the homogeneous model. This estimate and (|4.16j) then yield the desired 
upper bound 

1- N(l-E) <exp{ -C u E- d */ 2 } (4.19) 

for all E e]0, E u ] with the constant C u := f(t ) a~ d ^ 2 /n > 0. 

(b) Lower bound. This time we use the upper bound of Lemma 
and estimate 

1-N(l-E)>l--E[N$, K (1-Ej\ 
1 



\Qk\ 

1 

IQ^I 



E 



{eigenvalues of H£ K > 1 — E}\ 
[E max (H£ tK ) > 1 - E] , 



(4.20) 



where E max (H$ <K ) = e^ K {\Q K \) = sup 0¥:lpett(QK) (tp,H^ iK <p}/(<p,(p) denotes 

the maximal eigenvalue of Hfi . The choice ip = -0o = |Q K |~ 1//2 for the trial 
function yields 



1 

—1 



Therefore we get together with (|4.20[) 

1 _r 1 ^ 



l-N(l-E)^ 



\Qk\ 



\Qk\ 



> -E 



S=K-\-l 



So far, k^N was fixed arbitrary. Now we choose 

K = K(E) := min |r G N : p s < — j 

and conclude 



s=r+l 



l-iV(l-J5) > 



1 v ^ 



(4.21) 



(4.22) 



(4.23) 



(4.24) 
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Since the random variables are assumed to be i.i.d., we obtain 



1 



\Qk(i 



E 



xeQt 



E 
~2 



Vie Qk(e) '■ u x > 

f \ \Qk(e\ 

= c -a(E)\QK ( E)\ 



(4.25) 



with g(E) := —\ijPq[uj Xo > —E/2] for every E > 0. The function g is well- 
defined because of Assumption (|2.4[) on the tails of the single-site distribution. 
In fact (|2.4p implies the estimate 



O^g(E) < m|ln(S/2)| -InC (4.26) 

for all sufficiently small £7 > with some ^-independent constants C, m > 0. 

We have by definition (|4.23l) that Y^=K(E) P s ^ E/2. In conjunction 
with the upper bound in (|2.12j) . this yields the existence of Ei > such that 
p K(E) ^ c 2/d BE -! for aU E e ] £ ; ; ] with thc constant Ci := [2C 2 p/(p- 

l)] rfs / 2 . Since p ds / 2 = n, we conclude 

|Qx-(B)| <QE- rf «/ 2 (4.27) 

for all E g]0, £?/]. Collecting (j4"2"4"]) . (j4~23T) and fOT]) . we finally arrive at the 
desired lower bound 

1 - N(l - E) ^ C^E d "' 2 exp{-CiE- d °/ 2 g(E)} (4.28) 

for all E e]0,Ei]. 

(c) Limit E \ 0. The lower bound ()4.28j) can be simplified by 
observing (|4. 26(1 . enlarging the constant Ci and diminishing Ef. there exist 
constants Ci,Ei > such that 

1-N(1-E)^ Cf 1 exp^C/S^/ 2 ! ln£7|} 
for all E g]Q, E{\. This clearly implies 



(4.29) 



lim inf 

E\0 



In | In [l - N(l - E)] 
lmB 



y 



On the other hand, we deduce from the upper bound (|4.19j) that 



lim sup 

E\0 



In | In [l - N(l - E)} 



\nE 



^ 2 ' 



and Theorem 



is proven. 



(4.30) 

(4.31) 
□ 



Appendix A. Ergodicity 

Here we briefly compile the ergodic structure of the hierarchical Anderson 
model, which we have not found in the literature. Due to the use of ergodic- 
ity, our Lemmas 12.21 and 13.31 give slightly stronger results, resp. have shorter 
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proofs, than corresponding statements in |Krl[ lKr2l IKr3j . who argued with- 
out it. 

In what follows, we enumerate X as in ()2.5|) and arrange the elements 
in increasing order from left to right, see Fig. [T] Also, we think of X as being 
isomorphic to the space 

X = ((?r)reN : fr G {0, 1, . . . , n r - 1} for all r G N and ^ £ r < ooj, (A.l) 
^ r=l ' 

which consists of sequences with only finitely many non-zero elements. The 
identification x = (£r)reN, which underlies (|A.1[) . works as follows: £1 deter- 
mines the position of x in Qi(x), where £1 = corresponds to the left-most 
position in Fig.[TJ £1 = 1 to the second position from the left, and so on. Sim- 
ilarly, £ r encodes the position of Q r ^.i(x) in Q r (x) for every r 2, where, 
again, cluster positions are counted from the left, starting with 0. For exam- 
ple, in Fig. [T] we have X5 = (2, 1, 0, 0, . . .). Every id eventually belongs to 
Qr (xo), the left- most cluster of rank tq, for some sufficiently large rg. There- 
fore any sequence (£r)reN has only finitely many non-zero elements. Moreover, 
we have the representation Xk — (^r)reti, if and only if k = £r|Qr— 1|- 

The countable space X can be equipped with an Abelian group struc- 
ture, which we write in an additive way 

x + V ■= ((&■ + Vr) mod n r ) r£N (A.2) 

for all x = (Cr)reN and y = (?7r)reN in X. The identity of the group is given 
by Xq and the inverse of x by — x :— (n r — £ r )reN- 

The discrete Abelian group X acts on il according to X x ft — > f2, 
(x,oj) M> t x (u)), where 

t x (cj) := (u> x+ y) ye x (A. 3) 

for every i£l and every ui := (u) y ) y <=x G ft. Since P = (^^ex^O) every t x is 
measure-preserving, and we have 

Lemma A.l. The group of transformations {t x } x ^x is ergodic with respect to 
P. 

Proof. Let A G ® xe x ^ e invariant under t x for every x G X. We show 
¥[A] is either or 1. Let e > be given. Since every product- measurable 
set can be approximated arbitrarily well by cylinder sets, there exists k G N 
and Z = X ieX Z x with Z x G Z3r for every x G X and Z x = R for every 
x G {xfe 6X:fc) k}, and such that 

PL4AZ]<e. (A.4) 

Here A denotes the symmetric difference. Pick ro G N such that x K G Qr-ot^o) 
and define w := (<5 r . ro +i) r eN G X. Hence, (w + Q ro (xo)) nQ ro (xo) = 0, which 
in turn implies the crucial identity 

P[t w (Z) nz]= P[t w (Z)] P[Z] = (P[Z}) 2 (A.5) 
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because t w (Z) = X xeX Z w+X . Using invariance of A under r w and (|A.5|i . we 
conclude 

s? F[A] - {P[A}) 2 = ¥[A n t w (A)] - P[Zn t w (Z)] + (P[Z]) 2 - (P[A\) 2 
^ ¥[(Ar\T w (A)) a {Zr\T w {Z))\ +2P[AaZ] 
^P[(AaZ)U (T ffi (i)Ar„(2))] + 2PL4AZ] 

4e. (A.6) 

In the second line of |Q) we estimated P[C] - ¥[D] ^ F[C A D] for events 
C and D, which follows from C* = (C\D) U (C D D) C (CaD)UD. The 
inequality in the third line of (|A.6[) is based upon the inclusion 

(C 1 nD 1 )\(c 2 nD 2 ) = (tf x nCSni?i)u(i>iaD§nCi) c (Ci\c 2 )u(z? 1 \ j d 2 ) 

and its mirror 1 < — >• 2. In order to get to the last line of (|A.6j) we used 
t w (C) At w (D) = t w (C&D), the fact that t w is measure preserving and 
(|A.4|) . Since e > is arbitrary in (IA.6I) , this completes the proof. □ 

The (F0lner) sequence of growing clusters (Qr(xo)) N exhausts the 
(amenable) ergodic group X and fulfils Shulman's temperedness condition 
[Ll Def. 1.1]. Therefore we can apply the general pointwise ergodic theorem 
of Lindenstrauss [Lj Thm. 1.2] and conclude 

Lemma A. 2 (Birkhoff ergodic theorem). For every F-integrable random vari- 
able h : — > C we have 

J&Wi E Hr m { U ))=E[h] (A.7) 
for F-almost every to E Q. 

The reason for introducing the particular group structure (|A.2[) is its com- 
patibility with the hierarchical structure on X. This underlies 



Lemma A. 3. The hierarchical Anderson Hamiltonian (|2.1j) transforms co- 
variantly under the group action on fl, 

jjt.M = U*H U U X (A.8) 

for every x E X and every u) E Q. Here we have introduced the unitary 
representation of the group X on £ 2 (X), given by {U x ip)(y) := ip(y — x) for 
all x,y E X and all ip E ^? 2 (X). 

Proof. Clearly we have 

(U*V"U x ip)(y) = (V"U x i>)(x + y) = cj x+y (U x yj)(x + y) 

= (r x (uj))j(y) = (V^^)(y), (A.9) 
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so that it remains to verify the invariance of the hierarchical Laplacian 
{U*MJ x ^){y) = Y^p.&.UMx + y) = J2P°]nl E (^)(«) 

oo 1 

= E^joi E ^(v-x) 

s=i «ex+Q.(v) 

OO 

= E^ToT E ^H = (A^)(»). (A.IO) 

The equality in the second line of (|A.10j) rests on the identity 

Q s {x + y) = |(Cr)reN : Cr G {0, . . • , n r - 1} for r < s, 

Cr = (£r + ?/r ) mod n r for r > s j 
= a; + Q s ( 2 ,) (A.ll) 
for all s G N, x = (£ r )reN and y = (r] r ) re f$ in X. □ 
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